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Abstract — An important monitoring task for power systems 
is accurate estimation of the system operation state. Under 
the nonlinear AC power flow model, the state estimation (SE) 
problem is inherently nonconvex giving rise to many local optima. 
In addition to nonconvexity, SE is challenged by data integrity 
and cyber-security issues. Unfortunately, existing robust (R-) 
SE schemes employed routinely in practice rely on iterative 
solvers, which are sensitive to initialization and cannot ensure 
global optimality. A novel R-SE approach is formulated here by 
capitalizing on the sparsity of an overcomplete outlier vector 
model. Observability and identifiability issues of this model are 
investigated, and neat links are established between R-SE and 
error control coding. The convex semidefinite relaxation (SDR) 
technique is further pursued to render the nonconvex R-SE prob- 
lem efficiently solvable. The resultant algorithm markedly out- 
performs existing iterative alternatives, as corroborated through 
numerical tests on the standard IEEE 30-bus system. 

Index Terms — Power system state estimation, robustness, spar- 
sity, system identifiability, semidefinite relaxation. 

I. INTRODUCTION 

The electric power grid is a complex cyber-physical system 
consisting of multiple modules, each with a transmission 
infrastructure spanning over a huge geographical area, trans- 
porting energy from generation sites to distribution networks. 
Monitoring the operational conditions of grid transmission 
networks is of paramount importance to facilitate system 
control and optimization tasks, including security analysis and 
economic dispatch with security constraints; see e.g., 0] Ch. 1] 
and 1161 . For this purpose, various system variables are mea- 
sured in distant buses and then transmitted to the control center 
for estimating the system state variables, namely complex bus 
voltages. Due to the wide spread of transmission networks and 
the current integration of enhanced computer/communication 
infrastructure, the power system state estimation (SE) is chal- 
lenged by data integrity concerns arising due to "anomalous" 
measurements affected by outliers lfl2l . fl20l and/or adversarial 
cyber-attacks (U, 0"3l , |[T4l . These concerns motivate the 
development of robust approaches to improve resilience of SE 
to anomalous (a.k.a. bad) data. 

For the AC power flow model however, SE challenges come 
not only from anomalous data, but are further magnified due to 
the nonlinear couplings present between meter measurements 
and state variables. To cope with these challenges, Gauss- 
Newton iterative solvers estimate the state of an approximate 
linear regression model per iteration, using robust renditions 
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of the weighted least-squares (WLS) error criterion, such 
as the weighted least-absolute value (WLAV) one; see e.g., 
0] Ch. 2-6]. The current iteration adopts Taylor's first-order 
expansion around the estimate of the previous iteration to 
approximate the quadratic AC model with the aforementioned 
linear regression model. This iterative procedure is closely 
related to gradient descent algorithms for solving nonconvex 
problems, see e.g., J5] Ch. 1], which are known to encounter 
two issues: i) sensitivity to initialization; and ii) convergence 
concerns. Existing variants have asserted improved numerical 
stability of the iterative procedure 0] Ch. 3]. Latest SE trends 
incorporate linear state measurements offered by synchronized 
phasor measurement units (PMUs); see e.g., iflOl and refer- 
ences therein. However, limited PMU deployment currently 
confines SE to mostly rely on the traditional nonlinear me- 
ter measurements, and its companion Gauss-Newton iterative 
methods. Hence, it is very important to develop a robust (R-) 
SE solver tailored for the nonlinear measurement model, and 
capable of approximating the global optimum at polynomial 
complexity. 

The present paper adopts an overcomplete additive outlier- 
aware measurement model, and leverages the sparsity of out- 
liers to develop an R-SE approach to jointly estimate system 
states and identify the outliers present (Section HUi. 

Inherent to the overcomplete outlier-aware model is system 
under-determinacy, which in turn raises outlier observability 
and identifiability concerns (SectionlHlli. It was recently recog- 
nized that there are unobservable cyber-attacks that the system 
operator would fail to detect J4], lfT3l . Ifl4l . but all studies so 
far are limited to linear approximate SE models. Theoretical 
guarantees of the sparse outlier model were explored for 
nonlinear SE models in lEUl . Compared to these works, the 
present one provides a unifying framework to understand how 
tolerant the nonlinear regression model is to data corruption, 
by introducing the notion of measurement distance. The latter 
is nicely connected to distance metrics popular in channel 
coding theory, which are known to determine the error-control 
capability of channel codes. This connection reveals why the 
measurement distance is instrumental to characterizing the 
regression function's resilience to outliers. 

In addition, the novel R-SE framework lends itself to a con- 
vex relaxation approach, which yields R-SE solvers approxi- 
mating the global optimum (Section IIVb . A well appreciated 
tool for convexifying non-convex problems 05j, semidefinite 
relaxation (SDR) solvers thus emerge as powerful schemes 
for R-SE of nonlinear AC power flow models. Preliminary 
tests on the IEEE 30-bus system corroborate the performance 



improvement of the proposed approach (Section [V}. 

Notation: Upper (lower) boldface letters will be used for 
matrices (column vectors); (-) T denotes transposition; (-) u 
complex-conjugate transposition; Re(-) the real part; Im( ) the 
imaginary part; Tr(-) the matrix trace; rank(-) the matrix rank; 

the all-zero matrix; || • || p the vector p— norm for p > 1; |_-J 
the floor of a real number; and | • | (Z) the magnitude (angle) 
of a complex number. 

II. Modeling and Problem Statement 

Consider a power transmission network with N buses 
denoted by the set of nodes AT := {1,...,N}, and L 
transmission lines represented by the set of edges £ := 
{(n, m)} C J\f x J\f. Suppose M measurements are taken 
for estimating the complex voltage states {V n } n eJ\f, from a 
subset of the following system variables: 

• Pn{Qn)- the real (reactive) power injection at bus n 
(negative if bus n is connected to a load); 

• Pmn(Qmn)- the real (reactive) power flow from bus m 
to bus n; and 

• |Vfj|: the voltage magnitude at bus n. 

Compliant with the AC power flow model [2 J, these 
measurements obey nonlinear equations relating them with 
the system state vector v := [Vi , . . . , Vn] t £ C N . These 
equations also involve the injected currents of all buses that 
are here collected in the vector i := [ii,... ,In] T €E C N , as 
well as the currents, flowing from say bus m to n, denoted 
by Imn- Kirchoff's law in vector-matrix form simply dictates 

1 = Yv, where Y E c WxAr denotes the grid's symmetric bus 
admittance matrix having (m, n)-th entry given by 

{-y mn , if (m,n) e £ 

ynn + Y, V £N n Vnv, if m = 71 (1) 
0, otherwise 

with y mn denoting the line admittance between buses m and 
TV, Vnn bus n's admittance to the ground; and Af n the set of all 
buses linked to bus n through transmission lines. In addition, 
the current flow is given by I mn = y mn V m + y mn (V m - V n ), 
with y mn standing for the shunt admittance at bus m associ- 
ated with line (m, n). Clearly, all current variables are linearly 
related to the state v. As for the nonlinear measurements, the 
AC power flow model asserts that the apparent power injection 
into bus n is given by P n + jQ n = V n T^, while the apparent 
power flow from bus m to bus n by P mn + jQ mn = V m I%~ n . 
Further, expressing the squared bus voltage magnitude as 
| V^, | 2 = V n V^, it is clear that all measurable quantities listed 
earlier are nonlinearly (in fact quadratically) related to v. 

Apart from the nonlinearity present, another challenge 
present in the SE is due to grossly corrupted meter measure- 
ments (a.k.a. bad data). Statistical tests such as the largest 
normalized residuals of the weighted least-squares (WLS) 
estimation error are typically employed to reveal and remove 
bad data fl6l . Alternatively, robust estimators, such as the 
least-absolute deviation, or Huber's M-estimators have also 
been considered; see e.g., 0] Ch. 6], Motivated by recent 



advances in sparsity-aware robust statistical inference ||9), the 
fresh look at robust SE (R-SE) advocated in this paper is 
by an overcomplete model for the outlying data. To this 
end, collect first the M measurements in the vector z := 

[{Pn}n€Af P , {Qn} 

{|Ki| 2 }neAfi/] T ' where the check mark differentiates measured 
values from the noise-free variables^. Consider also the scalar 
variables {at}fL l one per measurement, taking the value 
a? = if the ^-th measurement obeys the nominal (outlier- 
free) model, and ag ^ if it corresponds to a bad datum. 
This way, the nonlinear measurement model becomes 

zi = M v ) +U + a t , i=l,...,M (2) 

where hg(-) captures the quadratic relationship specified by 
the aforementioned AC power flow equations, and the zero- 
mean additive Gaussian white noise (AWGN) ei is assumed 
uncorrected across meters with variance aj. 

Recovering both v and the Mxl vector a := [oi , . . . , aif] T 
essentially reveals the state and identifies faulty measurements. 
However, the system in (fJJ with both v and a being unknown 
is under-determined, as the number of measurements M is 
always less than the number of unknowns N+M. Instrumental 
to handling this under-determinacy will be the (arguably low) 
percentage of outliers, which gives rise to a (high) level of 
sparsity, that is the number of zero entries in a. The degree 
of sparsity will be further linked in the ensuing section with 
the notions of observability and identifiability of the outlier 
vector. By capitalizing on the sparsity of a, the goal of jointly 
estimating and identifying v and a can be achieved by the 
following outlier-sparsity-controlling criterion 

M 

{v, a} :=argminy^u^ [z e - h e (v) - a e ] 2 + A||a|| (3) 

v,a * — ' 

1=1 

where W£ := 1/cf W, and A > scales the regularization 
term which comprises the £o-P seu donorm, i.e., the number of 
non-zero a/s that naturally controls the number of outliers in 
a. Even with linear models however, solving the optimization 
problem in © is NP-hard due to the ^o- norm regulariza- 
tion J6J. Before proposing efficient schemes for solving the 
under-determined problem in (f3), the next section will provide 
observability and identifiability analysis to assess the ability 
of R-SE to cope with sparse outlier patterns. 

III. Outlier Observability and Identifiability 

The goal of this section is to investigate fundamen- 
tal uniqueness issues associated with the system under- 
determinacy arising due to the overcomplete outlier-aware 
model in To isolate uniqueness from noise resilience 
issues, focus is placed on the noise-free outlier-aware mea- 
surement model written in vector form as 

z = h(v) + a (4) 

1 For consistency with other measurements, | V n \ 2 is considered from now 
on. This is possible by adopting \V n \ = \V n \ + ey, where ty is zero- 
mean Gaussian with small variance a v , to obtain the approximate model 
IV^I 2 \V„\ 2 + e' v , where e' v has variance 4\V n \ 2 o v . 



with the high-dimensional function h(-) : C N — > R M . 

Definition 1. Given measurements z = h(v D ) + a G , with v D 
denoting the true state, and h(-) known, the outlier vector a Q 
is observable if and only if ( iff) V v G the set 

V := {v e C N \ z = h(v ) + a = h(v)} (5) 

is empty. Furthermore, the outlier vector a G is identifiable iff 
V v G the set 

S :={(v,a)| h(v) + a = z, ||a|| < ||a o || } (6) 
has only one element, namely (v ,a ). 

But why are outlier observability and identifiability intu- 
itively important? For an observable a D , upon collecting z, 
the system operator can discern whether there are bad data or 
not. In addition, for an identifiable a D , the system operator can 
recover exactly (in the absence of nominal noise) both a D and 
v Q in the presence of bad data. 

Definition Q] implies that if a D is identifiable, then it is 
necessarily observable, because otherwise the set V in © 
would have at least one element v' 6 C N ; in which case, the 
pair (v', 0) would be an additional second element of S in © 
- a fact contradicting identifiability. Therefore, as a property 
of an outlier vector a c identifiability is stronger than (i.e., 
subsumes) its observability. 

Without accounting for the nominal AWGN in (0, it is 
possible to reduce the cost in ([3]l to only the £o- norm i while 
including the quadratic part as equality constraint to obtain 

{v, a}:=arg min ||a|| 

h(v)+a— z 

= arg min ||z-h(v)|| . (7) 

v,a— z— h(v) 

Clearly, for the noise-free R-SE problem in (|7), the pair 
(v ,a ) is feasible, and the cost evaluated at (v ,a ) equals 
|z — h(v )||o = |ja ||o- This is also the minimum cost 
attainable when a G is identifiable, as there is no other pair 
(v,a) with smaller £o-norm ||a||o according to Definition [TJ 
Conversely, if the noise-free problem (|7) has a unique solution 
given by (v ,a ), then a D is identifiable. Similarly, the noise- 
free R-SE formulation can easily detect the presence of bad 
data if the minimum achievable is non-zero. This clearly 
demonstrates the role of the outlier vector's £ -norm in the 
R-SE criterion (0, in identifying the presence of bad data, or, 
in recovering the true state even when bad data are present. 

A critical attribute for an observable (identifiable) outlier 
vector is its maximum sparsity level K (respectively Ki). 
To appreciate this, consider the two broad classes that outliers 
typically come from. The first class includes bad data emerging 
due to faulty meters, telemetry errors, or software bugs, which 
generally occur rarely, that is with low probability; see e.g., 
J9] and references therein. Here, K a quantifies the maximum 
number of bad data that can be revealed with high probability; 
while Ki denotes the maximum number of outlying meters that 
can be identified so that recovery of the true state becomes 
feasible. The second source of outliers comprises malicious 



data attacks, in which the adversary can typically control only 
a subset of meters with limited cardinality J4], ||T3l . 1(141 . In 
this class of outliers, K Q and Ki can suggest the minimum 
number of meters that must be protected to render malicious 
data attacks ineffective. 

Even though K Q (Ki) is useful for assessing the degree of 
outlier observability (identifiability), deciding whether a given 
vector a D is observable or identifiable for the nonlinear AC 
model (HJi is challenging, except for the trivial case a = 0. 
Fortunately, it is possible to obtain K a and Ki by leveraging 
the notion of the measurement distance for any nonlinear 
function h(-), as defined next. 

Definition 2. The measurement distance for the function h(-) : 

C N R M is given by 

D(h) := min||h(v)-h(v')||o 

M 

= ninVl[/i<(v)-^(v')] (8) 

v=6v' — ' 
1=1 

where 1 denotes the indicator function. 

The notion of measurement distance parallels that of the 
Hamming distance in channel coding theory; see e.g., JU Sec. 
7.11]. Given any linear mapping over a known finite field, 
the Hamming distance characterizes the minimum difference 
between any two strings that lie in the mapped space, and 
it can be easily computed for fixed problem dimensions. 
However, for the R-SE problem of interest, v is drawn from 
the complex field C , while the mapping h(-) is quadratic. 
Compared to the Hamming distance it will be generally very 
challenging to compute D(h) in dS). 

Interestingly, as the Hamming distance has been popular 
due to its connection with the error control capability of linear 
channel codes, the measurement distance in (© will turn out 
to be particularly handy in characterizing outlier observability 
and identifiability, as asserted in the following proposition. 

Proposition 1. Given the measurement distance D for the 
nonlinear function h(-) in (|8]l, the maximum sparsity level 
of an observable outlier vector is K Q = D — 1, while the 
maximum one of an identifiable outlier vector is Ki = [ ' P ~ 1 j. 

The proof for both statements follows readily from Def- 
inition [U using simple contradiction arguments, and for this 
reason it is omitted. Notice that the second part can also be 
deduced after adapting lEUl Thm. 5.1], which neither explicitly 
relates to the notion of measurement distance, nor it is linked 
with the maximum sparsity level of observable outliers. 

Using the measurement distance metric, Proposition Q] pro- 
vides a unifying framework to understand the tolerance of 
any function h(-) to the number of outlying data. Since the 
measurement distance of any nonlinear function is difficult to 
obtain, the ensuing subsection pursues linearized approximants 
of the quadratic measurement model, which are typically 
employed by Gauss-Newton iterative SE solvers, and can be 
used to provide surrogate distance metrics. Depending on 
initialization, the linear approximants could not only be very 



accurate, but will also shed light on understanding uniqueness 
issues associated with nonlinear AC power system models. 

A. Linear Approximation Model 

Consider linearizing the nonlinear measurement model (3) 
expressed in terms of the polar coordinates of the state vector, 
as in e.g., (U Sec. 2.6]. Toward this end, the N x 1 complex 
vector v is mapped first to the 2N x 1 real vector x := 
[\Vi\,...,\V N \,lVi,...,lV N ] e R2N - Invokin g the first- 
order Taylor expansion, the noise-free z can be approximated 
around a given point v, or the corresponding x, by 

z = h(v)+a«h(v) + H 2 (x-x) + a (9) 

where H 2 £ K Mx ( 2A denotes the Jacobian matrix evaluated 
at x. Upon defining z := z — h(v) + H 2 x, the approximate 
model (O becomes a linear one in the unknown x, that is 

z«H 2 x + a. (10) 

The measurement distance of the linear function in (fTOb can 
be found easily, as summarized next. 

Proposition 2. The measurement distance for any linear 
mapping characterized by a full column-rank matrix H 2 € 
r mx(2N) is D = M+ i_ r ank(H 2 ). 

The proof relies on simple linear algebra arguments as 
follows. Using Definition 12 the measurement distance D := 
min x _ x /^o ||H 2 (x — x')||o is attained when matrix H 2 has at 
most (M—D) linearly dependent rows; otherwise, the number 
of zero entries of H 2 (x — x') would be [M — D + 1) and 
that of non-zero ones (D — 1), which leads to a contradiction; 
hence, rank(H 2 )= M — D + 1, as asserted by Proposition [2] 

Recalling from Proposition Q] how D is linked with the out- 
lier observability and identifiability levels, the next corollary 
follows readily. 

Corollary 1. For any linear mapping characterized by H 2 , 
the maximum sparsity level of an observable outlier is K Q = 
M— rank(H 2 ), while the maximum sparsity level of an iden- 
tifiable outlier is Ki = [ — ral ^ k ( H:c ) j_ 

For the linear approximation model in (Hot , the measure- 
ment distance D grows linearly with the number of meters 
M. This demonstrates that measurement redundancy is very 
beneficial for improving resilience to outliers. Conceivably, 
D could be further boosted thanks to the nonlinearity in h( ). 
Compared to its linear counterpart, the quadratic function h(-) 
is likely to increase the dimension of the space that is mapped 
to, and thus lead to a larger measurement distance in a space of 
higher dimensionality. This is precisely the reason why highly 
nonlinear functions find important applications to cryptogra- 
phy 0. Although linearization provides a viable approximant, 
quantifying (or bounding) the measurement distance for the 
quadratic h( ) corresponding to the AC power flow model 
constitutes an interesting future research direction. 



IV. Solving the R-SE via SDR 

This section will leverage convex relaxation techniques to 
solve the R-SE problem in (01. First, building on the premise 
of compressive sampling (6), the ^i-norm can be employed to 
tackle the NP-hard ^o-norm an d relax the R-SE cost in © to 

M 

{v,a} :=argminy^ w e [z e - he(v) - ag} 2 + A||a||i. (11) 

v,a — ' 

l=\ 

The I'l-norm relaxation has been used for various robust statis- 
tical inference tasks, with documented theoretical guarantees; 
see e.g., J9). It has also been adopted recently for R-SE 
with the linear measurement model in |[T2l . and also with 
the nonlinear one in ll20l using the linearization technique 
employed by iterative Gauss-Newton SE solvers. However, 
the pertinent performance analysis in 11201 has been given 
for a general nonlinear regression model, but not for the 
specific quadratic measurement model corresponding to AC 
power systems. Moreover, it is worth stressing that the iterative 
optimization framework adapted in ||201 to solve (QT} offers 
no guarantees regarding convergence or global optimality. 
In a nutshell, the desiderata remains to develop an R-SE 
solver capable of accounting for the practical AC quadratic 
measurement model, while attaining or approximating the 
global optimum at polynomial-time complexity. 

This task will be pursued here using semidefinite relaxation 
(SDR), which has been recently recognized as a powerful 
technique for convexifying the SE with nonlinear measurement 
models 11221 . To this end, each quadratic measurement z& will 
be expressed linearly in terms of the outer-product matrix 
V := vv". Let {e n }* =1 denote the canonical basis of M. N , 
and define the following admittance-related matrices 

Y„ := e„e^Y (12a) 

~^mn ■ — (y?rm H~ ymn)^m^ m ymn&m& n (1 2b) 

and their related Hermitian counterparts 

H Fi „:=l(Y„ + Y«), Hq,„:=|(Y„-Y£) (13a) 
Hp,mn : ~ 2 (^tnn + Y TOn ) , HQ j?Tm := — (Y mn — Y m7l ) 

(13b) 

VLv,n ■= e„e^. (13c) 

Using these definitions, the following lemma is proved in El 
to establish a linear model in the complex rank-one matrix V. 

Lemma 1. All error-free measurement variables are linearly 
related with the outer-product V as 

P n = Tr(H P ,„V), Q n = Tr(H Q , n V) (14a) 
P mn = Tr(H P , mn V), Q mn = Tr(H Q , mn V) (14b) 
\V n \ 2 =Tr(LLy, n V). (14c) 

Thus, the measurement Zi in (O can be written as 

z e = h e (v) + e t + a e = Tr(H^V) + e e + a e (15) 



where Hg is a Hermitian matrix specified in accordance with 
(fT3ab-(fT3cT>. 



Lemma Q] implies the following equivalent reformulation of 

CD [cf. O] 



M 



|Vi,ai| := argmin^wi [zg - Tr(H^V) - ag} 2 + A||a||i 
s.to V e C WxAr h 0,and rank(V) = 1 



(16a) 
(16b) 



where the positive semi-definiteness and rank constraints 
jointly ensure that for any V admissible to d!6bt . there always 
exists a state vector v G C N such that V = vv w . 

Albeit the linearity between zg and V in the new formula- 
tion ( TTBT l. nonconvexity is still present in two aspects: i) the 
cost in d 16ab has degree 4 wrt the entries of V; and ii) the rank 
constraint in d!6bb is nonconvex. Aiming for a semidefinite 
programming (SDP) formulation of (fl~6), Schur's complement 
lemma, see e.g., J5] Appx. 5.5], can be leveraged to convert 
the summands in ( 1 16ab to a linear cost over an auxiliary vector 

x e 



Specifically, with w 



[Wl, 



,wl\ t and likewise 
for x, consider an R-SE reformulation as 

jv 2 ,a 2 ,x 2 } := arg min w T x + A||a||i 

s.to V t 0,and rank(V) = 1, 

Xi zt - Tr(H,V) 



at 



zg - Tr(HfV) - ag 



(17a) 

(17b) 

r< W. 
(17c) 



Upon adapting results from J22J, the equivalence among all 
three R-SE formulations can be asserted as follows. 

Proposition 3. For the AC power flow model, all three non- 
convex formulations in (II U . d 1 6b . and (1171 ), solve an equivalent 
R-SE problem. For the optima of these problems, it holds that 

1 2 



Vi = V 2 = vv w and % 2 ,£ = 



zg - Tr(H^V 2 



W. (18) 



Proposition [3] establishes the relevance of the novel R-SE 
formulation ( TPTl i. which is still nonconvex though, due to 
the rank-1 constraint. Fortunately though (TTTt is amenable 
to the SDR technique, which amounts to dropping the rank 
constraint and has well-appreciated merits as an optimization 
tool; see e.g., Ifl5l for a tutorial treatment of its applications 
in signal processing and communications. The contribution 
here consists in permeating the benefits of this powerful 
optimization tool to estimating the state of AC power systems, 
even when outliers (bad data or cyber-attacks) are present. 

In the spirit of SDR, relaxing the rank constraint in (117b) 
leads to the following SDP formulation: 



{V,a,x} 



s.to v y o, 



arg min w x + A||a||i 



-XI 
z t - Tr(H,V) 



zg - Tr(H £ V) 

-1 



a i 



(19a) 

(19b) 

r< W. 
(19c) 



SDR endows R-SE with a convex SDP formulation 
for which efficient schemes are available to obtain the 
global optimum using, e.g., the interior-point solver Se- 
DuMi ff8l . The worst-case complexity of this SDP solver is 
C>(Af 4 \/iVTog(l/e)) for a given solution accuracy e > IfTSI . 
For typical power networks, M is in the order of N, and thus 
the worst-case complexity becomes 0(N 4 - 5 log(l/e)). Further 
computational complexity reduction is possible by exploiting 
the sparsity, and the so-called "chordal" data structure of 
matrix V, as detailed in l22l . 

Nonetheless, the SDP problem $1% is only a relaxed version 
of the equivalent R-SE in ( fTTI ); hence, its solution V may 
have rank greater than 1, which makes it necessary to recover 
a feasible estimate v from V. This is possible by eigen- 
decomposing V = J2l=i AjU^u^, where r := rank(V), 
Ai > • ■ ■ > A r > denote the positive ordered eigenvalues, 
and {ui G C are the corresponding eigenvectors. Since 

the best (in the minimum-norm sense) rank-one approximation 
of V is AiUiU^, the state estimate can be chosen equal to 
v(ui) := VAiUi. 

Besides this eigenvector approach, randomization offers 
another way to extract an approximate R-SE vector from 
V, with quantifiable approximation accuracy; see e.g., 03]. 
The basic idea is to generate multiple Gaussian distributed 
random vectors v ~ CAT(0, V), and pick the one with the 
minimum error cost corresponding to the set of inlier meters 
Mi := {£\l < t < M, ag ^ 0}. Note that although any 
vector v is feasible for (fTTT i, it is still possible to decrease the 
minimum achievable cost by rescaling to obtain v(i/) = cis, 
where the optimal weight can be chosen as the solution of the 
following convex problem as 



arg mm wg {zg 



oo 



c 2 v u YL,v\ 



eeM, 



HieMi wgzgi/ H Hgv 



T,eeMi wg^Ugu] 



2 ' 



(20) 



It will be of interest to find approximation bounds for the 
SDR-based R-SE approach, or, obtain meaningful conditions 
under which the relaxed solution coincides with the unrelaxed 
one. Both problems constitute interesting future directions for 
analytical research, while the ensuing section will demonstrate 
the performance improvement possible with the proposed 
method using numerical tests of practical systems. 

V. Preliminary Simulations 

The novel SDR-based R-SE approach is tested in this sec- 
tion using the IEEE 30-bus system with 41 transmission lines 
ifTTl , and compared to existing WLS methods that are based on 
Gauss-Newton iterations. The software toolbox MATPOWER 
lETl is used to generate the pertinent power flow and meter 
measurements. In addition, its SE function doSE has been 
adapted to realize the WLS Gauss-Newton iterations following 
from ll20l . The iterations terminate either upon convergence, 
or, once the condition number of the approximate linearization 
exceeds 10 8 , which flags divergence of the iterates. To solve 
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Fig. 1. Comparing estimation errors in voltage magnitudes and angles 
between SDR and WLS solvers at different buses. 



the SDR-based R-SE problems, the MATLAB-based optimiza- 
tion package CVX ifTTIl is used, together with the interior-point 
method solver SeDuMi lfl8l . 

The real and reactive power flows along all 41 lines are mea- 
sured, together with voltage magnitudes at 30 buses. AWGN 
corrupts all measurements, with a? equal to 0.02 at power 
meters, and 0.01 at voltage meters. Except for the reference 
bus phasor V re f = 1, each bus has its voltage magnitude 
Gaussian distributed with mean 1 and variance 0.01, and 
its voltage angle uniformly distributed over [— 0.5-7T, 0.57r]. 
The empirical voltage angle and magnitude errors per bus, 
averaged over 500 Monte-Carlo realizations, are plotted in Fig. 
Q] In each realization, one power flow meter measurement is 
randomly chosen as a bad datum, after multiplying the meter 
reading by 1.2. Clearly, the proposed algorithm greatly reduces 
the effects of bad data in the estimation error in voltage phase 
angles (upper), which is a more important SE performance 
metric than the magnitude one. 

VI. Conclusions and Current Research 

For the practical nonlinear AC power system model, unique- 
ness issues and robust state estimation (R-SE) algorithms were 
investigated in this paper, when outliers (bad data and/or mali- 
cious attacks) are present. Using a sparse overcomplete outlier 
model, observability and identifiability issues were quantified 
using the notion of measurement distance for the quadratic 
measurement model. Valuable insights and computable levels 
of outlier observability and identifiability were provided for 
linear approximations of the quadratically nonlinear models. 
A novel SDR-based scheme was also developed by tactfully 
relaxing the nonconvex R-SE problem to a convex SDP one, 
thus rendering it efficiently solvable via existing interior-point 
methods. Preliminary numerical simulations on the 30-bus 
benchmark system demonstrated improved performance of the 
proposed R-SE scheme. 



Further enhancements to the SDR-based R-SE framework 
are currently pursued toward developing more efficient and 
tailored solvers by exploiting sparsity of the SDP problem 
structure. The measurement distance for the power meter 
quadratic functions is also under investigation using insights 
from nonlinear channel coding theory. 
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